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Tamhane  and  Bechhofer  (1977,  1979)  studied  in  depth  a  two-stage  elimination 
type  procedure  (Pj)  for  selecting  the  largest  normal  mean  when  the  common  variance 
is  known.  In  order  to  determine  a  set  of  constants  necessary  to  implement  Pj,  they 
proposed  a  criterion  of  minimizing  the  maximum  over  the  entire  parameter  space  of 
the  expected  total  sample  size  required  by  Pj  subject  to  the  procedure’s  guaranteeing 
a  specified  probability  of  a  correct  selection.  As  a  consequence,  Pj  based  on  this 
unrestricted  minimax  design  criterion  possesses  the  highly  desirable  property  that 
the  expected  total  sample  size  required  by  P^  is  always  less  than  or  equal  to  the 
total  sample  size  required  by  the  best  competing  single-stage  procedure  of  Bechhofer 
(1954),  regardless  of  the  true  configuration  of  the  population  means. 

The  logistic  distribution  has  been  widely  used  by  Berkson  (1944,1951,1957)  as  a 
model  for  analyzing  experiments  involving  quanta!  response.  Pearl  and  Reed  (1920) 
used  this  in  studies  connected  with  population  growth.  Plackett  (1958,1959)  has 
considered  the  use  of  this  distribution  with  life  test  data.  Gupta  (1962)  has  studied 
this  distribution  as  a  model  in  life  testing  problems. 

The  importance  of  the  logistic  distribution  in  the  modeling  of  stochastic  phe¬ 
nomena  has  resulted  in  numerous  other  studies  involving  probabilistic  and  statistical 
aspects  of  the  distribution.  For  example,  Gumbel  (1944),  Gumbel  and  Keeney  (1950) 
and  Talacko  (1956)  show  that  it  arises  as  a  limiting  distribution  in  various  situations; 
Birnbaum  and  Dudman  (1963),  Gupta  and  Shah  (1965)  study  its  order  statistics. 
Many  other  authors,  for  example,  Antle,  Klimko  and  Harkness  (1970),  Gupta  and 
Gnanadesikan  (1966)  and  Tarter  and  Clark  (1965),  investigate  inference  questions 
about  its  parameters. 

In  this  paper  we  consider  an  elimination  type  two-stage  procedure  for  selecting 
the  logistic  population  with  the  largest  population  mean  when  the  populations  have 
a  common  known  variance.  Using  an  approximation  to  the  distribution  of  the  sample 
means  from  a  logistic  population,  we  propose  a  two-stage  elimination  type  procedure 
Pi  and  a  non-linear  optimization  problem  by  using  a  minimax  criterion  to  find  a  set 
of  constants  needed  to  implement  Pi.  We  derive  lower  bounds  of  the  probability  of  a 


correct  selection  and  the  infimum  over  the  preference  zone  of  the  lower  bounds.  We 
determine  the  supremum  of  the  expected  total  sample  size  needed  for  Pj  over  the 
whole  parameter  space.  We  provide  tables  of  constants  to  implement  and  of  the 
efficiency  of  Pj  relative  to  the  corresponding  single-stage  procedure  Pi  for  the  two 
special  cases  of  the  equally  spaced  and  slippage  configurations. 
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2  Distribution  of  logistic  sample  means 


Because  of  the  similarity  between  the  logistic  and  the  normal  distributions,  the  sam¬ 
ple  mean  and  variance,  the  moment  estimators  of  the  logistic  population  parameters, 
are  effective  tools  for  statistical  decisions  involving  the  logistic  distribution.  Antle, 
Klimko  and  Harkness  (1970)  give  a  function  of  the  sample  mean  as  a  confidence 
interval  estimate  of  the  population  mean  when  the  population  variance  is  known. 
Schafer  and  Sheffield  (1973)  show  that  in  terms  of  the  mean  squared  error  the  mo¬ 
ment  estimators  of  the  logistic  population  parameters  are  as  good  as  their  maximum 
likelihood  estimators.  The  fact  that  the  distribution  of  a  sample  mean  has  mono¬ 
tone  likelihood  ratio  (MLR)  with  respect  to  the  population  mean  when  the  variance 
is  known  is  used  by  Goel  (1975)  to  obtain  a  uniformly  most  accurate  confidence 
interval  for  the  population  mean  and  a  uniformly  most  powerful  test  for  one-sided 
hypotheses  involving  the  population  mean.  The  sampling  distribution  of  the  mean  is 
a  primary  requirement  for  these  statistical  purposes.  The  papers  by  Antle,  Klimko 
and  Harkness  (1970)  and  Tarter  and  Clark  (1965)  used  a  Monte  Carlo  method  for 
this  distribution. 


Goel  (1975)  obtained  an  expression  for  the  distribution  function  of  the  sum  of 
independent  and  identically  distributed  (it'd)  logistic  variates  by  using  the  Laplace 
transform  inverse  method  for  convolutions  of  P61ya  type  functions,  a  technique  devel¬ 
oped  by  Schoenberg  (1953)  and  Hirschman  and  Widder  (1955).  He  provides  a  table 
of  the  cumulative  distribution  function  (cdf)  of  the  sum  of  it'd  logistic  variates  for  the 
sample  size  n  =  2(1)12,  x  =  0(0.01)3.99  and  n  =  13(1)15,  x  =  1.20(0.01)3.99.  George 
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VW.V.W1 


and  Mudholkar  (1983)  obtained  an  expression  for  the  distribution  of  a  convolution  of 
the  itd  logistic  variables  by  directly  inverting  the  characteristic  function.  However, 
since  both  formulas  of  Goel  (1975)  and  George  and  Mudholkar  (1983)  contain  a  term 
(1  —  ex)~k,k  =  1,. . .  ,n,  a  problem  of  precision  of  the  computation  at  the  values  of 
x  near  zero  arises  when  n  is  large.  George  and  Mudholkar  (1983)  also  show  that 
a  standardized  Student’s  t  distribution  provides  a  very  good  approximation  for  the 
distribution  of  a  convolution  of  the  iid  logistic  random  variables. 

In  this  section,  we  consider  an  approximation  problem  for  the  distribution  of  a 
standardized  mean  of  samples  from  a  logistic  population  by  using  Edgeworth  series 
expansions.  It  can  be  shown  that  this  is  a  far  better  approximation  than  the  Stu¬ 
dent’s  t  distribution  as  suggested  in  Goerge  and  Mudholkar  (1983)  and  hence  this 
approximation  will  be  used  henceforth. 

2.1  Logistic  distribution 

A  random  variable  X  has  the  logistic  distribution  with  mean  n  and  variance  a2, 
sometimes  denoted  by  L(/i,a2),  if  the  probability  density  function  (pdf)  of  X  is 
given  by 

f(x)  =  (ff/«r)[exp{-ff(x  -  n)/a}}\  1  +  exp {-g{x  -  m)M)_J  (1) 

and  the  cdf  of  X  is  defined  by 

F(i)  =  [1 +exp{-<?(i-M)/o}]_1,  (2) 

where  — oo  <  x  <  oo,  —  oo  <  /x  <  oo,  a  >  0  and  g  =  n/y/S.  This  distribution  is 
symmetrical  about  the  mean  /x. 

Letting  Y  =  (X  -  fi)g/cr,  the  random  variable  Y  has  the  logistic  distribution  with 
mean  zero  and  variance  7r2/3.  The  pdf  and  cdf  of  the  random  variable  Y  are  given 


and 

F(»)  =  U  +  «p{-»}|-* 

respectively,  where  — oo  <  y  <  oo.  (3)  may  be  written  in  terms  of  F(y)  as 

/(v)  =  F(»)(  l-F(y)). 

The  moment  generating  function  ( mgf )  of  Y  is  given  by 

MY(t)  =  r(i  +  t)r(i-t) 


=  nt/sinirt,  |f|  <  1. 


We  can  also  express  (6)  as 


A MO  =  E(-D,-1(2(2^-1  -  l)/(2 

j= o 


(4) 


(5) 


(6) 


(7) 


where  Bu' s  are  Bernoulli  numbers  defined  as 


z/(exp(x)  -  1)  =  £B„z7(i/!). 

i/=0 


(8) 


where  <px{t)  is  the  characteristic  function  of  the  random  variable  X  and  the  i/tk 
relative  cumulant  of  X,  A„(x),  is  defined  by 

A„(x)  =  X„(x)(X2(x))-"/J. 

Using  the  moments  of  X  and  the  definition,  the  first  few  of  the  i/th  relative  cumu- 
lants  of  X  are  given  by 

\i{x)  =  n/o, 

Aj(x)  =  1, 

M*)  =  6/5, 

Ae(z)  =  48/7, 

A8(x)  =  432/5, 

Aio(x)  =  145152/77, 

^2j+i(x)  ~  Of  J  =  1)2 .  (9) 

2.2  Edgeworth  series  expansions  for  the  distribution  of  the 
mean  of  samples  from  a  logistic  population 

Let  Xi,  X2, . . . ,  X„  be  a  random  sample  of  size  n  from  a  logistic  population  L(n,o2) 
with  mean  n  and  variance  a2  whose  cdf  and  pdf  are  given  in  (l)  and  (2)  respectively. 
Define  a  standardized  mean  of  samples  of  size  n  from  L(n,a2),  Z  say,  as 

^ 

=  ^(X  -  „),  (10) 

where  X  =  A  £J*=1  X*  is  the  sample  mean. 

Let  fn{z)  and  Fn(z)  denote  the  pdf  and  cdf  of  the  standardized  mean  of  samples 
of  size  n  from  L(n,o2).  Then  the  Edgeworth  series  expansions  of  the  fn(z)  and  Fn(z) 


axe  given  symbolically  as 

/n(«)  =  <t>{z)  +  £p;(*)«-'/2  +  C>(n"(l,+1)/2) 

;=i 

and 

Fn(z)  =  *(*)  -  4>{z)  jr  Pj[z)n~H2  +  0(n-^2) 

i=i 

respectively,  where  <f>(z)  and  $(z)  are  the  standard  normal  pdf  and  cdf  respectively 
and  Pj(z)  and  Pj(z)  are  polynomials  in  z,  which  are  obtained  up  to  u  =  10  in  Draper 
and  Tierney  (1973). 

Using  pj(z)  and  Pj{z)  from  TABLE  II  of  Draper  and  Tierney  (1973)  and  the 
relative  cumulants  of  X  given  in  (9),  the  Edgeworth  series  expansions  of  the  f„(z) 
and  Fn(z)  correct  to  order  n~s  are  given  by 


fn{z,u  =  6) 

=  *(*){!  + 

f  / 1  x  ,432 .  .  . 

+ 

+  (^)(  y)(f)Hio(^)  +  +  0(»-,/!)  (U) 

and 

Fn(z,t/  -  6) 

=  «(*)  -  *W{[(5)(|)ff,W|»-‘ 

+  l(gf)(y  )*«M  +  (§)(i)’»,(*)]n-! 

..  1,,432,„  .  . 

+  l(i!)(“T)H7(z) 

+  +  (^)(j)!ff..Wln-3}  +  0(  n-’l')  (12) 

respectively,  where  Hj(x)' s  are  the  Hermite  polynomials  of  degree  j,  which  are  defined 


(-7 -V  exp(-x*/2)  =  (-1  )’Hj(x)  exp(-ar/2),  j  =  0, 1, 
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The  Hermite  polynomials  follow  the  recurrence  relation 

Hj(x)  =  xH}- i(x)  —  (j  —  l)Hj~2(x),  j  =  2,3, , 
and  are  given  in  TABLE  III  in  Draper  and  Tierney  (1973)  up  to  j  =  1(1)30. 

3  An  elimination  type  two-stage  procedure  for  se¬ 
lecting  the  best  population 

3.1  Preliminaries 

Let  7iY,  t  =  l,...,fc,  denote  k  logistic  populations  with  unknown  means  and  a 
common  known  variance  o2,  and  let 

fl  =  {/Z=  <  m  <  oo,t  =  1,  —  , A?} 

be  the  parameter  space.  Denote  the  ranked  values  of  the  /t i,  by 

M[i)  <  •  •  •  <  M[t] 

and  let 

=  M{.)  -  M[,]- 

We  assume  that  the  experimenter  has  no  prior  knowledge  concerning  the  pairing 
of  the  7Tj  with  the  /*[,],  i  =  l,...,fc,  j  =  Let  7T(y)  denote  the  population 

associated  with  yj. 

The  goal  of  the  experimenter  is  to  select  the  ‘best’  population  which  is  defined  as 
the  population  with  the  largest  mean.  This  event  is  referred  to  as  a  correct  selection 
(CS).  The  experimenter  restricts  consideration  to  procedures  (P)  which  guarantee 
the  probability  requirement 

Pfi[cs\p}  >  p\  v^n(i),  (is) 
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where  6  >  0  and  1/k  <  P*  <  1  are  specified  prior  to  the  start  of  experimentation 
and 


n(£)  =  {fien\  (mi*]  -  m[*-i])  >  6} 

which  is  defined  as  the  preference  zone  for  a  correct  selection. 

Here  we  propose  an  elimination  type  two-stage  procedure  P2:=P2(ni,n2,Ii)  which 
depends  on  non-negative  integers  »2  and  a  real  constant  h  >  0  which  are  deter¬ 
mined  prior  to  the  start  of  experimentation.  The  constants  (ni,n2,/i)  depends  on  k, 
6  and  P*  and  they  are  chosen  so  that  P2  guarantees  the  probability  requirement  (13) 
and  possess  a  certain  minimax  property. 

Procedure  P2; 

Stage  1:  Take  nx  independent  observations 


!>•••> ni» 


from  7T<,  *  =  1, . . . ,  k,  and  compute  the  k  sample  means 

n*  ;= i 

Let  X(t]  =  maxx<y<k X)  .  Determine  the  subset  I  of  {1, . . .  ,k}  where 

I  =  MX!11  >  Xg  -  ho/y/Ki), 
and  let  717  denote  the  associated  subset  of  {7Ti, . . . , 717}. 


K 


Ik 


1.  If  717  consists  of  one  population,  stop  sampling  and  assert  that  the  popu¬ 


lation  associated  with  is  best. 


2.  If  717  consists  of  more  than  one  population,  proceed  to  the  second  stage. 


Stage  2:  Take  n2  additional  independent  observations  xjp ,  j  —  1  ,...,n2,  from 
each  population  in  7Tj,  and  compute  the  cumulative  sample  means 


Xi  = 


1  nl  n2 


n  l  +  n  2 


;'=i 


ft  i  +  n2 


for  t  €  I,  where 


n*  j=i 

Assert  that  the  population  associated  with  max,-6i  X,-  is  the  best. 

There  are  an  infinite  number  of  combinations  of  (ni,n2,/t)  for  given  k,  6  and  P*, 
which  will  exactly  guarantee  the  probability  requirement  given  by  (13),  and  different 
design  criteria  lead  to  different  choices.  We  will  consider  one  of  these  criteria. 

Let  S'  denote  the  cardinality  of  the  set  I  in  stage  one  and  let 


S 


0;  if  S'  =  1 
S';  if  S'  >  1. 


Then  the  total  sample  size  required  by  P2,  TSS  say,  is 


(14) 


TSS  =  kn-i  +  Sn2. 


Let  EjilTSSlPi]  denote  the  expected  total  sample  size  for  P2  under  /x. 

We  adopt  the  following  unrestricted  minimax  criterion  to  make  a  choice  of  (rii,  n2,  h) 
as  well  as  to  have  the  total  sample  size  TSS  small.  For  given  k  and  specified  S  and 
P*,  choose  ( ni,n2,h )  to 


minimize  sup  Pjj[T55|P2] 
jien 

subject  to  ^inf^ Pfi[CS\P2}  >  P*,  (15) 

where  («i,n2)  are  non-negative  integers  and  h  >  0. 

For  any  population  whose  sample  mean  has  the  MLR  property,  Bhandari  and 
Chaudhuri  (1987)  proved  that  the  least  favorable  configuration  (LFC)  of  the  two- 
stage  population  means  problem  is  a  slippage  configuration.  However,  the  problem 
of  evaluating  the  exact  probability  of  a  correct  selection  in  the  LFC  associated  with 
P2  is  complicated  and  still  remains  to  be  solved.  Here  we  will  consider  lower  bounds 
for  Pp\CS\Pi}  and  construct  conservative  two-stage  procedures. 


3.2  Lower  bounds  for  the  probability  of  a  correct  selection 
for  P2 

In  this  section  we  derive  lower  bounds  for  Pp{CS\P2 J.  These  lower  bounds  will  prove 
to  be  particularly  useful  since  we  will  prove  that  they  achieve  their  infimum  over 
fi(6)  at  n(6)  which  has  components 

P  =  P[ i)  =  •  •  •  =  P[t-i)  =  P[k]  -5,  6  >  0. 

This  result  will  permit  us  to  construct  a  conservative  two-stage  procedure  which 
guarantees  the  probability  requirement  (13). 

The  next  theorem  gives  one  of  these  lower  bounds  for  Pfi[CS\P2\. 

Theorem  3.1  For  any  p  €  H  we  have 
Pfi[CS\P2} 

too  k~} 

>  /  II  Fni(x  +  6Wv/nT/ff  +h)dFni(x) 

J-°°  i= l 

/oo  k—1 

II  Pnl+nAx  +  Skiy/ni  +  n2fa  )dFni+n,[x)  -  1,  (16) 

00  «=i 

where  F„(x)  is  the  cdf  of  the  standardized  sample  means  of  size  n  from  L(p,a2). 
Proof 

For  any  p  E  fl  we  have 
Pp[CSlP2) 


K 


ft 

Vi 

s 

s 
■  1 


=  P»\X gj  >  X$  -  Wv^  *,  *(*)  >  maxieiX(l)] 

>  >  xg  -  >  x(l),  v,-  ^  k] 

>  PfilXflj  >  x!'}  -  ho/y/nl ,  V*  ^  k) 

+Pfi[X{k)  >  X{i),  V*  7^  Ar]  —  1,  (17) 

since  P(AnB)  >  P(A)  +  P(B)  —  1  for  any  two  events  A  and  B.  Then  a  straightforward 
computation  leads  to  the  conclusion  of  this  theorem.  □ 
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Corollary  3.1  For  all  U  €  fl(£)  we  have. 


inf  Pnicsm  > 
#=n(«)  1  1  ~ 


f°°  {FnAx  +  Sy/rZ/t 7  +  h)}k~1dF„l(x) 

J  —  00 

+  [  {Fni+ni(i  +  Sy/rti  +  n2/o  +„,(*)  -  1. 

J  —  OO 


Proof 


The  proof  follows  immediately  on  noting  that  the  right  hand  side  of  (16)  is  non¬ 
decreasing  in  each  for  t  =  1, . . . ,  k  —  1.  □ 

Remark  3.1  Since  the  right  hand  side  of  (19)  is  strictly  increasing  in  each  of  nx, 
nx  +  n2  and  h  and  tends  to  one  as  nx,  or  n2  and  h  tend  to  oo,  we  see  that  the 
probability  requirement  (IS)  can  be  guaranteed  if  one  (or  more)  of  these  constants  is 
chosen  sufficiently  large. 

Remark  3.2  //  we  let  h  — ►  oo  on  the  right  hand  side  of  (16)  we  obtain 

f  II  Fm+nAx  +  hi\fnx  +  n2/o  )dFni+nj(i) 

J~°°  <=i 

which  is  an  expression  for  Pji[CS\Px\  where  Px  uses  a  common  single-stage  sample 
size  n  =  nx  +  n2  per  population.  Thus  Px  is  a  special  case  of  Pi  based  on  a  conservative 
lower  bound  and  hence  E^TS  S\Pi\  <  kn  for  all  jl  G  fl. 

Remark  3.3  The  distribution  of  the  mean  of  samples  from  logistic  population  has 
the  monotone  likelihood  ratio  (MLR)  property  with  respect  to  the  location  parameter 
(Goel  (1975))  and  hence  the  distributions  of  the  xj1^  and  are  stochastically 
increasing  (SI)  families  in  i  =  1, . . . ,  k. 

Remark  3.4  The  cumulative  sample  means 

+  n2  nx  +  ni 

are  strictly  increasing  in  eachx]3\  j  =  1,2,  t  -  l,...,k. 


wESS 


We  can  now  find  another  lower  bound  to  the  Pp[CS\P2]  given  in  the  following 
theorem  by  noting  the  facts  mentioned  in  Remark  3.3  and  Remark  3.4.  This  lower 
bound  can  be  shown  to  be  uniformly  superior  to  the  one  given  in  Theorem  3.1.  It  is 
also  straightforward  to  determine  the  LFC  of  the  population  means  relative  to  this 
new  lower  bound. 

Theorem  3.2  For  any  p  €  Cl  we  have 


inf  Pn[CS\P2\ 
pen(s)  1  1 


>  /"{F„,(i+<v^r/<7  +  h)}k-'dFm(x) 

J  — OO 

■[  {-*m+ n2{x  +  6y/nt  +  m/ff  )}t_1dF„l+na(x),  (20) 

J  -  OO 

where  F„(x)  is  the  cdf  of  the  standardized  sample  mean  of  size  n  from  L{p,a2). 


Proof 


Let  P(.|m<)  and  G(.|/it)  denote  the  cdf’s  of  the  and  X,-  respectively  and  let 
denote  the  joint  cdf  of  the  Xf*  and  Then  F(. [/**),  G(.|^<)  and  Jf(.,.|/x,) 
are  non-increasing  in  /x,,  i  =  1,. . .  ,k,  from  Remark  3.3  and  Remark  3.4.  Without 
loss  of  generality  we  may  assume  that  Pi  <  •  ■  •  <  Pk-  Then  for  all  p  G 


PACS\P2] 


-  P a[^(kj  >  maxi<,<*  X(‘j  -  ho/y/n{,  X(k)  =  maxiei  JV(j)] 

>  Pal^!i)  >  X!‘»  -  >  XUI,  Vj  =  l . k  - 1  ] 

=  /  /  II  H(x  +  ho/y/r^,y\pi)dH(x,y\pk) 

j—  OO  J  -  OO 

>  f  f  n  H{x  +  ho/y/n[,y\pk  -  6)dH{x,y\pk) 

J  -  OO  J -OO 

=  +  hoisruTcw\nt  -  <}]. 

where  the  expectation  is  with  respect  to  the  joint  distribution  of  Xjij  and  ~X(k)-  Hence 
iXL  pACS\P>\  >  inf 


fien(s) 


flen(s) 


and  it  is  enough  to  show  that  for  all  p,  £  Cl (6), 

~  ^}] 

>  +  h° I Tlx*  -  «)]£„  [G‘-'(Xw|m*  -  «)]. 

By  Remark  3.4,  for  all  a,  b  and  n, 

P'&W  <  •.*»  <  4} 

= 

=  E,\P.&®  <  a,X$  <  =^(4  -  ^^J!)l^gJ}] 

>  E„{P„{X [,‘j  <  a|xj’j> 

*.(*8  £  ^(4  - 

=  P'&$  <  “}P,(XW  <  4) 

and  hence 

B„l/r‘-‘{3r{l}  +  ha/SH,X(k)W  -  <}] 

>  +  ha/srk\K  -  «} 
•G*-‘{^(*)lf*-«}| 

>  +  ho/STi lot  -  <>1 
■EM[G*-'{X(»,K  -  <}] 

by  the  Chebyshev’s  inequality  (Hardy,  Littlewood  and  P61ya  (1934)),  since 

F{X|lj  +  holy/n[\nk  -  6} 

and 

G{X(k)|/xt-n 

are  non-decreasing  in  X^.  □ 

Remark  3.5  If  we  let 

a  =  f°°  {Fni(x  +  6y/n~l/o  +  h)}k~'dFni  (x) 

J  -oo 


UUIJIVU  IlllUlilUII 


and 

b  =  [  {F„l+„a( x  +  6y/ni  +  m/a  )}k~ldFni+n3{x)  , 

J  —  oo 

then  (19)  states  that 

inf  Pa[CS\P»]  >  a  +  b-  1 

and  (20)  states  that 

inf  Pa[CS\P2\  >  ab. 

3y  noting  that  a  +  b  —  1  <  ab  for  all  a,b  G  (0,1),  the  lower  bound  (20)  is  uni¬ 
formly  superior  to  the  lower  bound  (19),  and  hence  we  will  use  the  lower  bound  (20) 
henceforth. 

3.3  Expected  total  sample  size  for  P2 

In  order  to  solve  the  optimization  problem  (15)  we  first  find  an  analytical  expression 
for  the  Efi\TSS\P2\  and  then  determine  the  supjjen  Ejj[TSS\P2]  and  the  sets  of  /*,- 
values  at  which  this  supremum  occurs. 

Theorem  3.3  For  any  p  e  Cl  we  have 

EtlTSSlPt]  =  knt  +  n2  £  /°°  {  ]I  (*  +  6ijy/m/o  +  h) 

-  •  —  OO  1 

»=i  w  j=i 
i*i 

-  n  +  biiy/njo  -  h)}dFni(x),  (21) 

y=i 

i*i 

where  Fn[x)  is  the  cdf  of  the  standardized  sample  means  of  size  n  from  L{p,o2). 
Proof 

For  any  /Z  6  fi  we  have 

E?[TSS\P2}  =  km  +  mEulsiPi], 
where  S  is  defined  as  in  (14).  Now 

Ep\S\F>]  =  EAS,\P2\-PdS,=  l\Pt\ 
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(22) 


=  E  p> I*!.1,’  >  xg!  -  .  vy  #  .■] 

i'=i 

-  E  p,[xg,’  a  *g|  +  '“’/v's? .  vy  /  <| 

«=1 

and  hence  Theorem  3.3  follows  immediately.  □ 

The  following  theorem  summarizes  the  result  concerning  the  supremum  of  the 
EplTSSlPi]  for  £  <E  0. 

Theorem  3.4  For  any  p€  Cl,  fixed  k  and  (ni,n2,h)  we  have 

sup  Eji[T  S  S  |  P2] 

Aen 

=  kn !  +  n2  r  [{Fni(x  +  h))k~l  -  {Fni(x  -  h)}k-l}dFni(x)  (23) 

J  —  OO 

which  occurs  when  p\\\  =  •  ••  —  /xjk],  where  Fn(x)  is  the  cdf  of  the  standardized  sample 
means  of  size  n  from  L(p,o2). 

Proof 

Noting  Remark  3.3  and  Remark  3.4  we  can  use  the  results  of  Gupta  (1965)  which 
show  that  JEjj[S'|P2]  achieves  its  supremum  for  p  €  fi  when  p^  =  •  •  •  =  p[k\.  By  a 
similar  argument  Pp.\S'  =  l|P2]  achieves  its  infimum  when  p^  —  •  •  •  =  p\k j.  Hence 
the  result  follows  immediately  from  Theorem  3.3.  □ 

3.4  Optimization  problem  yielding  conservative  solutions 

In  this  section  we  consider  the  optimization  problem  (15)  which  one  must  solve  in 
order  to  determine  the  constants  (nj,n2,h)  which  are  necessary  to  implement  P2. 
As  we  noted  earlier,  the  problem  of  evaluating  the  exact  probability  of  a  correct 
selection  in  the  LFC  associated  with  Pj  is  very  complicated.  Thus  we  replace  the 
exact  inf /ien(tf) by  the  conservative  lower  bound  given  by  the  right  hand 
side  of  (20),  and  consider  the  following  optimization  problem. 

For  the  given  k ,  S  and  P*  choose  the  constants  (ni,n2,h)  to 

/°°  .  . 

[{Fn,(x  +  h)}k~l  -  {F„,(x  -  h))k~l}dFni  (x) 

-OO 


subject  to 


f00  {Fni{x  +  6y/n^/c  +  h)}k~l 

✓  -OO 

•  f  {Fm+>i2{x  +  6y/nx  +  n2/a  ))k~ldFni+tli(x)  >  P\  (24) 

J—  OO 

where  n i  and  n2  are  non-negative  integers  and  h>  0. 

Let  ns  denote  by  (hi,  fit,  h)  the  solution  to  the  optimization  problem  (24).  Then 
we  can  use  the  approximate  design  constants 

»i  =  [nx  -I- 1],  n2  =  [n2  +  l),  h  =  h, 

where  [z\  denotes  the  greatest  integer  which  is  less  than  z,  to  implement  P2. 

Table  1,  Table  2,  Table  3  and  Table  4  contain  the  constants  (hi,hi,h)  necessary 
to  approximate  (nj,n2,/i)  and  the  values  of  the  expected  total  sample  size  (ETSS) 
for  k  =  2,3,4,5,10,15,  P*  =  0.75,0.90,0.95,0.99  and  6/a  =  0.1, 0.5, 1.0, 2.0, 4.0.  All 
computations  were  carried  out  in  double-precision  arithmetic  on  a  Vax-1 1/780.  The 
SUMT  (Sequential  Unconstrained  Minimization  Techniques:  Fiacco  and  McCormick 
(1968))  algorithm  is  used  to  solve  the  non-linear  optimization  problem.  A  source 
program  in  Fortran  for  the  SUMT  algorithm  is  given  by  Kuester  and  Mize  (1973). 


3.5  The  performance  of  the  two-stage  procedure  relative  to 
the  single-stage  procedure 

As  a  measure  of  efficiency  of  the  two-stage  procedure  P2  relative  to  that  of  the  single- 
stage  procedure  Pi  when  both  guarantee  the  same  basic  probability  requirement 
(13),  we  consider  the  ratio  termed  relative  efficiency  (RE)  Eft{TSS\P2\/kh,  where  h, 
is  the  estimate  of  the  minimum  sample  size  n,  needed  in  the  sigle-stage  procedure 
P\.  Clearly  RE  depends  on  6  and  P*.  Values  of  the  RE  less  than  unity  favor  P2 
over  Pi. 

Now  the  RE  is  given  by 


RE  = 


+  n2  £  f  { II  *»,  (*  +  6H\fhi/a  +  h) 


i=lJ~ 00  ;  =  i 

/*» 
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(25) 


-  f[ 

i=i 
i*> 

where  h,  is  the  solution  of 

[°°  {Fn.{t  +  [yft r./a)6)}k-ldFn.(t )  =  P\  (26) 

We  consider  the  relative  efficiency  for  two  special  cases,  namely,  the  equally  spaced 
and  the  slippage  configurations.  First,  for  the  equally  spaced  configuration,  we  as¬ 
sume  that  the  unknown  means  of  the  k  populations  are  /z,  /z  +  6, . . . ,  /z  +  [k  —  1)6 
which  have  ranks  1,  2,...,fc,  respectively.  Let  RE ^  denote  the  relative  efficiency 
with  respect  to  the  above  configuration.  Then,  since  6,-j  =  jzjjj  —  /z[yj  =  (i  -  j)6, 

REeq  =  r^[fcn,  +  n2X;  [°°  {{[  Fhl(t  +  y/h^i  -  j)6 /a  +  h) 

Kn,  y=i 

-  n  *■«.  (< + >/m*  -  -  *)  w,  (oi-  (27) 

y=i 

>#• 

Next,  for  the  slippage  configuration,  we  assume  that  the  unknown  means  of  the  k 
populations  are  /z[j]  =  /z,  j  =  1, . . . ,  k  -  1,  and  /Z[t]  =  fi  +  6,  6  >  0.  Then  the  relative 
efficiency  with  respect  to  the  above  configuration,  RE,P,  is  given  by 

RE,P  = 

4-[**i  +  ">((*  - 1)  [°°  (n.(  t  +  h)  -  F»,(t  -  *))*-■ 

Kn9  J— oo 

■{Fhl  (t  -  yfcs/c  +  h)-  Fhl  (t  -  yfcs/o  -  h))dFhl  (t) 

+  j"  {Fhl{t  +  \Jh\6 /a  +  h)~  Fhl(t  +  v/m/<t  -  h))k-ldFhl(t)}\.  (28) 

J  ~oo 

Table  5  and  Table  6  give  the  values  of  the  REtq  and  RE,P  for  given  values  of 
P*  =  0.75,0.90,0.95,0.99,  k  =  2, 3, 4, 5, 10,15  and  S/a  =  0.1, 0.5, 1.0, 2.0, 4.0. 

For  any  values  of  P*,  k  and  6,  REeq  <  1  and  RE,P  <  1  and  hence  the  two-stage 
procedure  is  more  efficient  than  the  single-stage  procedure  in  terms  of  the  expected 
total  sample  sizes.  Furthermore,  the  effectiveness  of  P2  appears  to  be  increasing  in  k 
since  the  values  of  REeq  and  RE,P  are  decreasing  in  k. 


Table  1:  Constants  to  implement  the  two-stage  procedure  P3  for  selecting  the  largest 
logistic  population:  P*  =  0.75. 


'•1 

'i| 

•I 

HI 

Hi 

iT 


:: 


a 


N 


8 


s 


0.181720* +0S 
0.702648*4-01 


0.4548*4-02  0.4589*4-02  0.5530*4-01  0.181720*4-03 

0.2620*4-01  0.8929*4-00  0.7323*4-01  0.702648*4-01 

0.3121*4-00  0.4983*4-00  0.7097*4-01  0.162056*4-01 

0.9656«-01  0.1070*4-00  0.6698*4-01  0.406153*4-00 

0.2834 e-01  0.2232e-01  0.6127*4-01  0.101300e4-00 

0.1018*403  0.1044*403  0.3688*401  0.616938*403 

0.4778*401  0.3340*401  0.4516*401  0.243344e402 

0.8971*400  0.1050*401  0.7309*401  0.584085*401 

0.1876*400  0.2774*400  0.6606*401  0.139489*401 

0.8039«-01  0.3586*-01  0.6112*401  0.348738*400 

0.1392*403  0.1515*403  0.2751*401  0.112241*404 

0.5900*401  0.5554*401  0.2947e401  0.447121*402 

0.1711*401  0.1037*401  0.4169e401  0.109639e402 

0.3256*400  0.3270*400  0.5857*401  0.260899e401 

0.8061 e-01  0.8252*-01  0.5737*401  0.652252*400 


0.5900*401  0.5554*401 


0.1711*401  0.1037*401 

0.3256*400  0.3270*400 

0.8061 e-01  0.8252*-01 


0.2947e401 

0.4169*401 

0.5857*401 

0.5737*401 


0.10  0.1631*403  0.2013*403  0.2278*401  0.166485*404 

0.50  0.6766*401  0.7657*401  0.2341e401  0.666234*402 

5  1.00  0.1826*401  0.1630*401  0.2746*401  0.166961*402 

2.00  0.3864*400  0.4182*400  0.4050*401  0.399414*401 

4.00  0.9599*-01  0.1051*400  0.4087*401  0.998538*400 

0.10  0.2357*403  0.4304*403  0.1494*401  0.461824*404 

0.50  0.9587*401  0.1738*402  0.1468*401  0.181456*403 

10  1.00  0.2504*401  0.4455*401  0.1398*401  0.458878*402 

2.00  0.6367e400  0.1178*401  0.1361e401  0.117515*402 

4.00  0.1582*400  0.2758*400  0.1470*401  0.295260*401 

0.10  0.2714*403  0.5855*403  0.1369*401  0.744887e404 

0.50  0.1100*402  0.2372*402  0.1352*401  0.299197*403 

IS  1.00  0.2858*401  0.6119*401  0.1308*401  0.757369*402 

2.00  0.7466*400  0.1668*401  0.1255*401  0.195906*402 

4.00  0.1906*400  0.4032*400  0.1313*401  0.501004*401 


10  1.00 


Table  2:  Constants  to  implement  the  two-stage  procedure  P2  for  selecting  the  largest 
logistic  population:  P*  =  0.90. 


P*  =  0.90 

k 

6  jo 

ft, 

fh 

h 

ETSS 

0.10 

0.1668e+03 

0.1728e+03 

0.2446*+01 

0.650194e+03 

0.50 

0.7013e+01 

0.6404e+01 

0.2591e+01 

0. 259726e+02 

2 

1.00 

0.19S2e+01 

0.1311*+01 

0.3369e+01 

0.643201e+01 

2.00 

0.4011e+00 

0.3724«+00 

0.5331e+01 

0.154620e+01 

4.00 

0.1044e+00 

0.8907e-01 

0.5026e+01 

0.386564«+00 

0.10 

0.2745e+03 

0.251S«+03 

0.2017e+01 

0.1461S2e+04 

0.50 

0.1126e+02 

0.9634e+0l 

0.2071e+01 

0.585665e+02 

3 

1.00 

0.2971e+01 

0.2135«+01 

0.2332e+01 

0.146860e+02 

2.00 

0.6894e+00 

0.5189e+00 

0.6004e+01 

0.362197e+01 

4.00 

0.1693e+00 

0.1310e+00 

0.4955e+01 

0.90004 9e+00 

0.10 

0  3298e+03 

0.3318e+03 

0.1713e+01 

0.229940e+04 

0.50 

0.1340«+02 

0.1300«+02 

0.1728e+01 

0.922982e+02 

4 

1.00 

0.3489e+01 

0.3048e+01 

0.1796e+01 

0.232917e+02 

2.00 

0.8374«+00 

0.7008e+00 

0.2643e+01 

0. 59266 2e+01 

4.00 

0.2090«+00 

0.1704*+00 

0.2831e+01 

0.147462«+01 

0.10 

0.S664*+03 

0.4034e+03 

0.1S56e+01 

0.815013e+04 

0.50 

0.1488e+02 

0.1596e+02 

0.1553e+01 

0.126542e+03 

5 

1.00 

0.3863e+01 

0.3858e+01 

0.1559e+01 

0.320185«+02 

2.00 

0.9610e+00 

0.9217e+00 

0.1867e+01 

0.829954e+01 

4.00 

0.2403«+00 

0.2184e+00 

0.2071e+01 

0.208230e+01 

0.10 

0.4549«+03 

0.6465e+03 

0.1367e+01 

0.750100e+04 

0.50 

0.1844e+02 

0.2588e+02 

0.1357e+01 

0.301614«+03 

10 

1.00 

0.4784e+01 

0.6497e+01 

0.1328e+01 

0. 76566 2e+02 

2.00 

0.1328e+01 

0.1644e+01 

0.1257«+01 

0.201395«+02 

4.00 

0.3335e+00 

0.4262e+00 

0.1362e+01 

0.528481e+01 

0.10 

0.4934e+03 

0  7911«+03 

0.1366«+01 

0.1 19540e+05 

0  50 

0.1999e+02 

0.3177e+02 

0.1358e+01 

0.480822e+03 

IS 

1.00 

0.5180e+01 

0.8022e+01 

0  1335e+01 

0.122187e+03 

2.00 

0.143S«+01 

0.2074e+01 

0.1280e+01 

0.322460e+02 

4.00 

0.3751e+00 

0.5593e+00 

0.1328e+01 

0.865274«+0] 

r . —] 


Table  3:  Constants  to  implement  the  two-stage  procedure  P2  for  selecting  the  largest 
logistic  population:  P*  =  0.95. 


P*  =  0.95 

k 

S/t 7 

fti 

Aa 

k 

ETSS 

0.10 

0.3008«+03 

0.2827e+03 

0.1781e+01 

0.104953c+04 

0.60 

0.1227e+02 

0.1098e+02 

0.1810e+01 

0.421247«+02 

2 

1.00 

0.3216e+01 

0.2504e+01 

0.1968e+01 

0.106222e+02 

2.00 

0.7631e+00 

0.5883e+00 

0.3556e+01 

0.268233e+01 

4.00 

0.1899e+00 

0.1457e+00 

0.3785e+01 

0.667647e+00 

0.10 

0.4362e+03 

0.3667e+03 

0.l574e+01 

0.2I1419e+04 

0.50 

0.1768«+02 

0.1436e+02 

0.1589e+01 

0.849214e+02 

3 

1.00 

0.4579e+01 

0.3388e+01 

0.1664e+01 

0.214801e+02 

2.00 

0.1223e+01 

0.6952e+00 

0.2269e+01 

0.563339e+01 

4.00 

0.2858«+00 

0.1853e+00 

0.3237e+01 

0.139794«+01 

0.10 

0.4991*-f-03 

0.4519*+03 

0.1452e+01 

0.318364e+04 

0.50 

0.2023e+02 

0.1787«+02 

0.1453e+01 

0.127954«+03 

4 

1.00 

0.5232e+01 

0.4325«+01 

0.1464e+01 

0.324315*+02 

2.00 

0.1420«+01 

0.9417e+00 

0.1676e+01 

0.846044C+01 

4.00 

0.3S93e+00 

0.2423«+00 

0.2163e+01 

0.218343e+01 

0.10 

0.5381e+03 

0.5269e+03 

0.1392e+01 

0.426098«+04 

0.60 

0.2182e+02 

0.2088e+02 

0.1388«+01 

0.171314«+03 

5 

1.00 

0.5649e+01 

0.5112e+01 

0.1379«+01 

0.434710e+02 

2.00 

0.1546e+01 

0.1182e+01 

0.1430e+01 

0. 11404 5«+02 

4.00 

0.3809e+00 

0.3002e+00 

0.1751e+01 

0.299628c+01 

0.10 

0.6279e+03 

0.7682e+03 

0.1349«+01 

0.973702«+04 

0.50 

0.2544e+02 

0.3070«+02 

0.1342e+01 

0.391770e+03 

10 

1.00 

0.6692e+01 

0.7667e+01 

0.1321e+01 

0.996400e+02 

2.00 

0.1827e+01 

0.1923e+01 

0.1269e+01 

0.263641e+02 

4.00 

0.4897e+00 

0.5216«+00 

0.1344e+01 

0.72498SC+01 

0.10 

0.6674e+03 

0.9126e+03 

0.1377e+01 

0.153152e+05 

0.60 

0.2703«+02 

0.3659e+02 

0.1370e+01 

0.616396e+03 

IS 

1.00 

0.7002e+01 

0.9178«+01 

0.1354«+01 

0.156917e+03 

2.00 

0  1942e+01 

0.2339C+01 

0.1310«+01 

0.416523«+02 

4.00 

0.5293e+00 

0.6784e+00 

0.1S00e+01 

0.11S109«+02 
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Table  4:  Constants  to  implement  the  two-stage  procedure  P2  for  selecting  the  largest 
logistic  population:  P *  =  0.99. 


P*  =  0.99 

k 

S/a 

Ai 

ft? 

h 

ETSS 

0.10 

0.6892*403 

0,5071e403 

0.1295e401 

0.202774e404 

0.50 

0.2784e+02 

0.2014e402 

0.1S00e401 

0  81564 4e402 

2 

1.00 

0.7189e401 

0.4895e401 

0.1309e401 

0.207286e402 

2.00 

0.1966e401 

0.1107*401 

0.1414e401 

0.546768*401 

4.00 

0.5013e400 

0.2771e400 

0.2143e401 

0.148773e401 

0.10 

0.8588*403 

0.5804e403 

0.1248e401 

0.366047e404 

0.50 

0.3475e402 

0.2299e402 

0.1249e401 

0.147249e403 

3 

1.00 

0.89S4c+01 

0.5614*401 

0.1254e401 

0.374304*402 

2.00 

0.2442e+01 

0.1300*401 

0.1314e401 

0.988866e401 

4.00 

0.6518e+00 

0.3209e400 

0.2052e401 

0.278376*401 

0.10 

0.9268*403 

0.6663e403 

0.1254e401 

0.526885e404 

0.50 

0.3750e402 

0.2646*402 

0.1253e401 

0.211992e403 

4 

1.00 

0.9668e401 

0.6501e401 

0.1250*401 

0.539214e402 

2.00 

0.2647«+01 

0.1542*401 

0.1266*401 

0.142783*402 

4.00 

0.7350e+00 

0.4650e400 

0.1230e401 

0.404965*401 

0.10 

0.9639e403 

0.7432e403 

0.1271e401 

0.687580*404 

0.50 

0.3903*4-02 

0.2955*402 

0.1266e401 

0.276700e403 

5 

1  00 

0  1008e402 

0.7283e401 

0.1268e401 

0.704196e402 

2.00 

0.2770*4-01 

0.1761*401 

0.1252e401 

0.186821e402 

4.00 

0.7668*400 

0.4714e400 

0.1451e401 

0. 53290 2e401 

0.10 

0.1049e404 

0.9971e403 

0.1343e401 

0.149675e405 

0.50 

0.4246*402 

0.3976e402 

0.1340*401 

0.602284*403 

10 

1.00 

0.1099*402 

0.9879*401 

0.1327e401 

0.153541e403 

2.00 

0.3045*401 

0.2443e401 

0.1295e401 

0.409579e402 

4.00 

0.8789e400 

0.6787e400 

0.1394e401 

0.119728e402 

0  10 

0.1088*404 

0.1147e404 

0.1400*401 

0.231194*405 

0.50 

0.4405e402 

0.4583e402 

0.1396e401 

0.931205*403 

15 

1.00 

0.1140*402 

0.1143*402 

0.1384e401 

0.237593*403 

2.00 

0.3166e401 

0.2855*401 

0.1356*401 

0.636515*402 

4.00 

0.9277e400 

0.8022*400 

0.1455e401 

0.187987*402 

Table  6:  Relative  efficiency  of  the  two-stage  procedure  P2:  Slippage  configuration. 
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